Inducing grammar from IGT

نویسندگان

  • Lars Hellan
  • Dorothee Beermann
چکیده

We suggest a strategy for incremental construction of deep parsing grammars from Interlinear Glossed Text (IGT). IGT is a format of representation where standard linguistics and NLP in principle meet, since they are a data-type which is often available for digitally ‘less resourced languages’ (‘LRL’). The IGT database is TypeCraft (Beermann and Mihaylov 2009, www.typecraft.org), and the grammar technology so far employed is that defined in the LKB system of (Copestake 2002), an implementation of the HPSG grammar

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From IGT to precision grammar: French verbal morphology

Interlinear glossed text (IGT, the familiar three-line format of linguistic examples) can be an extremely rich source of linguistic information, when linguists follow best practices in creating it (e.g., the Leipzig glossing rules, Comrie et al. 2003). The ODIN project (http://www.csufresno.edu/odin; Lewis 2006) recognized the value of IGT data as a reusable data type and has created a searchab...

متن کامل

Learning Grammar Specifications from IGT: A Case Study of Chintang

We present a case study of the methodology of using information extracted from interlinear glossed text (IGT) to create of actual working HPSG grammar fragments using the Grammar Matrix focusing on one language: Chintang. Though the results are barely measurable in terms of coverage over running text, they nonetheless provide a proof of concept. Our experience report reflects on the ways in whi...

متن کامل

Towards Creating Precision Grammars from Interlinear Glossed Text: Inferring Large-Scale Typological Properties

We propose to bring together two kinds of linguistic resources—interlinear glossed text (IGT) and a language-independent precision grammar resource—to automatically create precision grammars in the context of language documentation. This paper takes the first steps in that direction by extracting major-constituent word order and case system properties from IGT for a diverse sample of languages.

متن کامل

Increased adiponectin receptor-1 expression in adipose tissue of impaired glucose-tolerant obese subjects during weight loss.

OBJECTIVE To investigate the mRNA expression of adiponectin, AdipoR1 and AdipoR2, the two recently cloned adiponectin receptors and peroxisome proliferator activated receptor (PPAR)gamma2 in adipose tissue of obese individuals before and during a very low calorie diet (VLCD) inducing weight loss. METHODS Twenty-three non-diabetic obese subjects with normal (NGT, n = 11) or impaired glucose to...

متن کامل

Language CoLLAGE: Grammatical Description with the LinGO Grammar Matrix

Language CoLLAGE is a collection of grammatical descriptions developed in the context of a grammar engineering graduate course with the LinGO Grammar Matrix. These grammatical descriptions include testsuites in well-formed interlinear glossed text (IGT) format, high-level grammatical characterizations called ‘choices files’, HPSG grammar fragments (capable of parsing and generation), and docume...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011